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Abstract 

Several, apparently distinct, formalisms exist in the literature for predicting the clustering of dark 
matter halos. It has been noticed on a case- by-case basis that the predictions of these different 
methods agree in specific examples, but there is no general proof that they are equivalent. In 
this paper, we give a simple proof of the mathematical equivalence of barrier crossing, peak- 
background split, and local biasing. 
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1 Introduction 



The large-scale clustering of dark matter halos has become an important probe of primordial 
cosmology. In particular, non-Gaussianity in the initial conditions would leave an imprint in the 
scale-dependence of the halo bias [1, 2], sometimes of stochastic type [3, 4]. Several, apparently 
distinct, methods are commonly used to compute these effects. So far, these methods have been 
considered to be independent, even though they give the same results when applied to specific 
examples [4, 5]. In this paper, we will show that the barrier crossing (BC) model, the peak- 
background split (PBS) method and the local biasing (LB) approach are, in fact, mathematically 
equivalent. 

Barrier crossing is the classic model of structure formation dating back to the pioneering 
work of Press and Schechter [6]. In its simplest formulation, it identifies halos as regions of the 
linearly evolved density field above some critical density 5 C . The clustering properties of halos 
can then be calculated as an Edgeworth expansion in the cumulants of the probability density of 
the primordial density fluctuations, which in turn can be expressed in terms of iV-point functions 
of the potential [4, 5, 7, 8]. 

Peak-background split is a method for calculating the influence of long- wavelength fluctuations 
(larger than the halo size) on the locally measured statistical properties. It has been widely used 
in cosmology [9, 10] and its usefulness in dealing with non-Gaussian initial conditions has been 
first pointed out in [1]. In the most common implementation, the non-Gaussian field is defined 
as a non-linear function of auxiliary Gaussian fields, which are split into short-wavelength and 
long-wavelength components. By modulating the statistics of the short modes, the long modes 
affect the clustering statistics. In this paper, we will generalize the PBS approach so that it 
can be applied to arbitrary non-Gaussian initial conditions, parametrized by arbitrary iV-point 
functions of the primordial potential. This will require introducing additional fields p2, P3, ■ • • , 
which measure the local power spectrum amplitude, skewness, etc. 

Local biasing [11-15] refers to the idea of expressing the halo density field Sh as a function 
the local dark matter density (smoothed on some scale) and expanding in powers of the density 
contrast 6, 

5 h (x) = hS(x) + b 2 5 2 {x) + b 3 5 3 (x) + ■■■ . (1.1) 

Correlation functions can then be computed straightforwardly in terms of the coefficients in 
the expansion. Several variations of this formalism exist in literature (for example some use an 
expansion in the non- linear dark matter density, while others use the linearly evolved density) . In 
this work, we will demonstrate the equivalence between barrier crossing and a particular variant 
of local biasing, in which the expansion is in the linearly evolved and non- Gaussian dark matter 
density contrast. 

In a companion paper [4], we derived the clustering statistics for specific non-Gaussian models, 
both in the peak-background split formalism and in the barrier crossing model. We showed for 
each example that both approaches give consistent results. The goal of this paper is to prove that 
this agreement isn't accidental, but follows from the mathematical equivalence of both methods. 
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The outline of the paper is as follows. After defining our notation in Section 2, we introduce 
our main technical tool in Section 3: a series expansion for the halo field 5^ in the barrier crossing 
model. We review some examples of non-Gaussian models and show how the series expansion 
is used for efficiently calculating halo power spectra. In Section 4, we use the series expansion 
to prove the mathematical equivalence of the barrier crossing model, the peak-background split 
method, and the local biasing formalism. We conclude with brief comments in Section 5. An 
appendix collects some elementary properties of Hermite polynomials. 



2 Preliminaries and Notation 

Non-Gaussian initial conditions can be parameterized by the connected iV-point functions £^ 
of the primordial gravitational potential In Fourier space, these are defined as 

($k 1 $k 2 ---$k N )c = (2n) 3 5 B (k 12 ... N )tii N) (k 1 ,k 2 ,...,k N ) , (2.1) 

where k\2,..N = k\ + k 2 + ■ ■ ■ + k^. The primordial potential is related to the linearly evolved 
matter density contrast via Poisson's equation 

5 h (z) = a(k,z)$ k , (2.2) 

where 



<*(k,z) = » V Vrs • ( 2 - 3 ) 



2k 2 T(k)D(z) 

Here, T(k) is the matter transfer function normalized such that T(k) — )■ 1 as k — > and D(z) is 
the linear growth factor (as function of redshift z), normalized so that D(z) = (1 + z)^ 1 in matter 
domination. For notational simplicity, we will from now on suppress the redshift argument from 
all quantities. The field 5 M (x) denotes the linear density contrast smoothed with a top-hat filter 
of radius R M = (3M/47rp m ) 1//3 . In Fourier space, 

6 M (k) = W M (k)5 k , (2.4) 

where W M {k) is the Fourier transform of the top-hat filter, 

rxr n \ o sin(kR M ) - kR M cos(kR M ) 

W M (k) = 3 ^3 . (2.5) 

We also define a M = {5 2 M ) 1 / 2 and u M (k) = W M (k)a(k). 

The main quantity of interest, in this paper, is the halo density contrast in Lagrangian space 

_ n h (x) - (n h ) 
5 h (x) = — , (2.6) 

where n^x) is the halo number density. To lowest order, 5h is related to the halo overdensity 
in Eulerian space via 5^ = 5^ + 5. We will determine the large-scale behavior of the matter-halo 
and halo-halo power spectra -P m h(&) = (S6h)(k) and -Phh(^) = (<^A)(^)- We define Phh(k) to be 
the halo power spectrum after the shot noise contribution 1 /n^ has been subtracted, where is 
the halo number density. Analogously, we define P m h(k) to be the matter-halo power spectrum 
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after subtracting the 1-halo term (in practice, this term is usually negligibly small). We define 
the (Lagrangian) halo bias as 

m - • (2.7) 

This is related to the Eulerian bias via b E = b + 1 . A stochastic form of halo bias arises whenever 
the density of halos isn't 100% correlated with the dark matter density [4]. In that case, the bias 
inferred from Phh will be different from the bias inferred from P m h, i.e. 



Pmm\k) v mm(^) 

3 A Series Representation of Barrier Crossing 

In this section, we introduce the barrier crossing formalism and quote results from our companion 
paper [4] . We also introduce a series representation of barrier crossing, which will be our main 
tool to prove the equivalence to local biasing and peak-background split in Section 4. 

3.1 Review of Barrier Crossing 

In the simplest version of the barrier crossing model [6], halos of mass > M are modeled as 
regions of space in which the smoothed density field 5 M exceeds the collapse threshold 5 C « 1.4, 
i.e. the halo number density n^ix) is given by 

K w (x) oc e(S M (x) - 5 C ) , (3.1) 

where is the Heaviside step function. Eq. (3.1) models the abundance of a mass-weighted 
sample of halos whose mass exceeds some minimum value M. 1 We will also consider the case of 
a halo sample defined by a narrow mass bin, which is obtained from the mass-weighted case by 
differentiating with respect to M, i.e. 

<(x)<x-^e(6 u (x)-8 e ) . (3.2) 

Throughout the paper, we will refer to these two types of halo samples as "mass-weighted sam- 
ples" (MW) and "narrow samples" (N). 

The barrier crossing model allows us to compute the statistics of halo-halo and halo-matter 
correlations. To discuss correlations between quantities at two points x and x', it is useful to 
define 5 M = 5 M (x),5' M = S M (x') and r = \x — x'\. The joint cumulants of the density fields are 
then 2 

^n(r,M) - (3.3) 

u u M 

Km , n (r,M,M) ^ {{SM) Z i5 * )n)c ■ (3.4) 

°M° ' M 

1 This type of sample is often assumed when fitting models to observations of luminous tracers such as galaxies 
or quasars. In the absence of detailed knowledge of the halo occupation distribution (HOD), a simple choice is to 
assume that halos below some minimum mass M are unpopulated with tracers, whereas the expected number of 
tracers in a halo of mass > M is proportional to the halo mass. 

2 Note that the variance of the unsmoothed linear density contrast a 2 — {S 2 } is formally infinite, but cancels in 
the definition (3.5) of the quantity /j n which will appear in our final expressions. 
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The hat on K^ n denotes the use of the unsmoothed density field 5. In the limit k — > 0, we 
find kj-,(A;) — > P m m(k)/(cra M ) and ki,i(/c) — )■ -P m m(^)/(cAff m)- This motivates the following 
definitions 

k« Af) 

/!,»(*, M, M) S Kl i k ,fa, ] <<* «>1. (3.6) 

/ m ,n(fc,M,M)E K ri^; M) form,n>2. (3.7) 

M, M)a M crM 

Using the function a(k, z) defined in (2.2), it is straightforward to relate the above cumulants to 
the primordial correlation functions £^ defined in (2.1). 

In [4] , we showed how the matter-halo and halo- halo power spectra are computed in the barrier 
crossing model using the Edgeworth expansion for the joint probability density function p(5 M , S' M ). 
(We refer the reader to that paper for detailed derivations and further discussion.) The result 
can be expressed in terms of the cumulants /j n and f m , n - Taking the limit k — > for the case of 
a mass-weighted sample with M = M, we find 



P mh (k, M) = P mm (k) b™ w (M) + a n (M)f i n (k, M) , (3.8a) 

V n>2 / 

P hh (k,M) = P mm (k)(b A g IW (M) 2 + 2b AIW (M)Y,a n (M)f 1:n (k,M,M) (3.8b) 

V n>2 

+ a m (M)a n (M)f m>n (k,M,MU , 

m,n>2 / 

where the coefficients a n (not to be confused with the a of eq. (2.2)) are defined in terms of 
Hermite polynomials (see Appendix A), 

*n(M) = J* , with v e (M) = ±. (3.9) 

V vr e rfc(^z/ c ) n\ a M 

We also defined the Gaussian bias as 

= . (3.10) 

Note that b™ w (M) is the Press-Schechter bias for the mass- weighted halo sample. In writing (3.8), 
we have dropped "nonlinear" terms in the Edgeworth expansion, i.e. terms involving products 

( K mini K m2n2 ■ ■ ■ K mp n p ) with p > 1. 
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Similarly, for the case of a halo sample defined by a narrow mass bin, we have 

P mh (k,M) = P mni (k) ( b N g (M) +J2^n(M)f ln (k,M) J , (3.11a) 

P hh (k,M) = P ima (k)lb^M) 2 +2b^(M)y^V n (M)f hn (k 1 M,M) _ (3.11b) 

\ y y ^-^ M=M 

\ n>2 

+ V m (M)V n (M)f m , n (k,M,M) 



M=M ) 

m,n>2 / 

where we have defined the differential operator 

V n {M) = (3 n {M) + /3„(M)— — , (3.12) 

a in a M 

as well as the functions 



'" X ' " i — . ,, M ...... , ..... „ , M ... , ..... 

tr M v c n\ n\u c 



b N g (M)^—^— , ^(M) = -^ and /3 TO (M) = ^ L; . (3.13) 



Note that 6^ (M) is the Press-Schechter bias of a halo sample defined by a narrow mass bin. In 
eq. (3.11b) for P^, we have assumed M = M for simplicity, but the variables M and M should 
be treated as independent for purposes of taking derivatives. 

3.2 Hermite Polynomial Expansion 

In this section, we will develop an alternative (to the Edgeworth expansion) algebraic framework 
for analyzing clustering in the barrier crossing model. First, consider the case of a mass- weighted 
halo sample, where the halo field is modeled as a step function 

K w (x) oc e (v(x) - v c ) , where v(x) = . (3.14) 

OA/ 

Since the Hermite polynomials H n (v) are a complete basis, any function of v can be written 
as a linear combination of Hermite polynomials. In particular, we can write the Heaviside step 
function Q(v — v c ) as 

oo 

Q(y -y c ) = y^ a n (u c ) H n {u) , (3.15) 

n=0 

where 

1 f 00 e~ v2 / 2 [ krfc(4=i/ c ) n = 



71 ' 

" v n! ^2tt' 

Plugging this series expansion into eq. (3.14), and normalizing the halo field to the fractional 



overdensity 5^, we get 



^ a (v c ) V °u 



b™ w (M)5 M (x) + ^2a n (M)p n (x) , [mass-weighted sample] (3.17) 



n>2 
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where a n (M) and b 
are denned as 



™ W (M) were introduced in eqs. (3.9) and (3.10), respectively. The fields p n 



x 



H r , 



(3.18) 



On large scales, the field p2 = tracks long- wavelength variations in the locally measured 

small-scale power, and for non-Gaussian initial conditions the power spectrum P P2 p 2 (k) may 
acquire extra large-scale contributions. Analogously, the field p3 = 8%,/a^ M — 35 M /a M tracks 
long- wavelength variations in the locally measured small-scale skewness, and so on for higher p n . 



[h~ 3 Mpc 3 ] 




k [hMpcr 1 ] 

Figure 1: Convergence of the series representation (3.17) at low k, illustrated by comparing terms in the 
halo-halo power spectrum Phh(k) = bg(M)P mm (fc) + a n(M)Pp n p Tl (fc) in a Gaussian cosmology. (Note 

that for Gaussian initiai conditions, cross power spectra P PmPn (k) with m 7^ n are zero.) We have taken 
2 = and a mass-weighted sample of halos with mass M > 2 X 10 13 /i" 1 Mq. 

We emphasize that the series representation (3.17) is mathematically equivalent to the barrier 
crossing model, since it is obtained by simply substituting the convergent Hermite series (3.15) 
into the barrier crossing expression (3.14) for n n . The series representation converges for all 
values of x, but its usefulness depends on how rapidly it converges, i.e. how many terms we need 
to get a good approximation. For example, to compute the halo field 5 n (x) at a single point x in 
real space, many terms are needed (of order 100) and the series representation is not useful. On 
the other hand, the Fourier transformed series representation 5 n (k) = b™ w 8 M + Yl n < =2 a nPn(k) 
converges rapidly on large scales (i.e. k -C fc n i), as shown in fig. 1, and the series representation 
is very convenient. (The series converges for all k, but only converges rapidly for k <C k n \.) 

The preceding expressions have all applied to the case of a mass-weighted halo sample. For 
the case of a halo sample defined by a narrow mass bin, the halo field is modeled as 



(x) oc 



d f 5 M (x) 



din a M 
d 

d In a M 



El \ZJ ( &m( x ) 
a n (v c )H n \ 



n>0 



y~] ( (n + l)v c a n+1 (v c ) + a n {v c )— 9 — J 

n>0 



Hr, 



S M {x) 



(3.19) 
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Normalizing to the fractional halo overdensity 5^, we get 

a n+ i(u c ) a n {u c ) d \ ( 5 M (x) 



n>l v v 



We drop the term containing d5 M /dlna M , since this term vanishes on large scales, k <C i?^ 1 , 
and write the result using the notation b g ,(3 n ,f3 n defined in eq. (3.13): 

S h (x) = b N g {M)5 M {x) + \Pn{M) + ^(M)^— J Pn (x) . [narrow sample] (3.21) 

As a check on our formalism, we can verify that the matter-halo and halo-halo power spectra ob- 
tained from the series (3.17) agree with the results obtained previously in [4] using the Edgeworth 
expansion. We first write the power spectrum Ps Pn (k) in terms of the correlation function (2.1), 

Pspn(k) = ^ m / a M (-\k + q\) x ei n+1) (fe,Qi,--- ,q n -i,-k-q) , (3.22) 

°M \ i=l Jq t J 

where we have defined J q (•) = J (•) and q = YH=\ Qi- Similarly, we can express P PmPn (k) 



as 3 



t m—l „ \ I n— 1 



m m Vi=i / \j=i ■ / 9j / 

x ^T +n) (g'i, • • • , q' m -i, ~q + k,q 1 ,--- , g n _i, -q-k) , (3.23) 

where q' = 9i ■ Using the notation /j and / min defined in eqs. (3.5) and (3.7), and taking 

the limit k — > 0, we find 

p Spn(k) = f hn (k,M)P mm (k) , (3.24) 
P PmPn (k) = f m , n (k, M, M) P mra (k) . (3.25) 



3 We have made an approximation here: by using connected correlation functions in eqs. (3.22) and (3.23), 
we have neglected some contributions to the power spectra Ps Pn and P PmPn ■ More precisely, we have neglected 
disconnected terms whose factorization contains multiple higher cumulants (i.e. K m ,„ with m + n > 3), and also 
some contributions to P PmPn (k) which approach a constant as k — > 0. (Note that subleading terms in the Hermite 
polynomial p n = (Sm/cm)" — n(n — 1)(5 M /o- M ) n ~ 2 /2 + ■ • ■ cancel the largest disconnected contributions to the 
power spectra in eqs. (3.22) and (3.23).) The derivation in [4] of eq. (3.8) contains equivalent approximations, which 
is why we will shortly find agreement with the results of [4]. In principle, one can avoid making any approximations 
by including disconnected contributions when calculating power spectra Ps Pn and P PmPn . However, in Appendix A 
of [4] , we showed that these approximations are always valid in the observationally relevant regime where the initial 
perturbations are close to Gaussian. 
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For the case of the mass-weighted halo sample, the series representation (3.17) therefore gives 
the following matter-halo and halo-halo power spectra 

P mh (k,M) = P mm (k) ^a n (M)/ ijn (fc,M)j , (3.26a) 
P hh (k,M,M) = P mm (k) I a m (M)a n (M)f m , n (k,M,M)) , (3.26b) 

\m,n>l J 

in agreement with the Edgeworth calculation (3.8). The case of the narrow mass bin can be 
verified similarly. 

Eqs. (3.17) and (3.21) are the main results of this section and give a series representation for the 
halo field in the barrier crossing model, for the cases of a mass- weighted halo sample and a narrow 
mass bin respectively. Using the series representation, we will give a simple, conceptual proof of 
the equivalence of the barrier model, the peak-background split, and local biasing in Section 4. 
However, it is useful to first build intuition by considering a few example non-Gaussian models. 

3.3 Examples 

For a given non-Gaussian model, one can analyze large-scale clustering by keeping a small set 
of terms in the series expansion of 5^ (either eq. (3.17) or (3.21) for a mass- weighted sample or 
narrow mass bin, respectively), and computing the necessary power spectra P PmPn {k) on large 
scales. This is a computationally convenient way to compute the non-Gaussian clustering signal, 
and allows the signal to be interpreted physically as arising from large-scale variations in locally 
measured quantities such as small-scale power and skewness, as we will see in the context of some 
example models. 

3.3.1 tnl Cosmology 

Consider a non-Gaussian model in which the initial Newtonian potential is given by 

<$>{x) = <l ) {x) + U(4> 2 {x) -{<!?)) , (3.27) 

where (j) is a Gaussian field. We will refer to this as the "/nl model" (or local model). This 
type of non-Gaussianity arises somewhat generically in multi-field models of the early universe, 
e.g. modulated reheating models [16], curvaton models [17, 18], or multi- field ekpyrotic scenar- 
ios [19, 20]. In this section, we will study a generalization of the / NL model which we will call the 
"t nl model". This type of non-Gaussianity arises in "multi-source" models, i.e. models in which 
quantum mechanical perturbations in multiple fields determine the initial adiabatic curvature 
perturbation [3, 21-23]. The non-Gaussian potential <3? is given in terms of two uncorrelated 
Gaussian fields (j) and ip, with power spectra that are proportional to each other 

$(a;) = </>(x) + i;(x) + / NL (1 + n) 2 (V 2 (*) - <V 2 » , (3.28) 

where / NL and II = P ( p(k) / P^(k) are free parameters. It is easy to compute the three- and 
four-point functions, 

d 3) = /nl [P1F2 + 5 perms.] + O(f^) , (3.29) 
4 4) = 2 (t) 2 r NL [PiP 2 Pi3 + 23 perms.] + 0(r 2 L ) , (3.30) 
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where we have defined r NL = (|/ N l) 2 (1 + 11), P, = P$(ki), and Py = P$(|fei + kj\). It is conven- 
tional to parametrize this model with variables {/nl,t nl }, which correspond to the amplitudes 
of the 3-point and 4-point functions, rather than the variables {/ NL ,n}. The / NL model (with 
IT = so that ip contributes but not cj)) corresponds to the special case r NL = (|/ N l) 2 - 

To compute halo clustering in the r NL model, we keep the first two terms in the series expansion 
for 5h (eqs. (3.17) and (3.21)), obtaining: 

bg IW S M + (X2P2 , [mass-weighted sample] 

&h={ ( ~ „ x (3-31) 

b N g 8 M + \fa{M) + HM)-^^) P2 . [narrow sample] 

Using eqs. (3.29) and (3.30) in eqs. (3.22) and (3.23), we obtain the following power spectra in 
the k — > limit: 

Ps P2 (k) = 4/ NL %^ , (3.32) 



Pp2P2(k) — 16 (|) T NL a 2ffc\ ■ (3.33) 



Putting everything together, we find 



Pmh(fc) = (b g + / NL ^j P mm (£;) , (3.34a) 
PhhW= ^ + 26 9 / NL A + (| ) 2 TNL _^_^p mm(A;) . (3 . 34b) 

where we have defined the non-Gaussian bias parameter 

p I 4«2(M) , [mass- weighted sample] ,^ 

\ 4/32 (M) • [narrow sample] 

In both the mass-weighted and narrow mass bin cases, the non-Gaussian and Gaussian parts of 
the bias are related by /3f = 25 c b g . Note that in the narrow mass bin case, there is a derivative 
term in 8^ (the term dp2/d\na M in eq. (3.31)), but this ends up giving zero contribution to the 
power spectra P m h and Phh, since the power spectra P$ P2 and P P2P2 are independent of M in the 
t nl model. 

Our calculation of the clustering power spectra (3.34) agrees with previous calculations in the 
literature (e.g. [3, 4]) but the series representation gives some physical intuition: the large-scale 
non-Gaussian clustering is due to large-scale fluctuations in the field p2, which we interpret as 
long-wavelength variations in the locally measured small-scale power. If r NL = (|/nl) 2 > then 
long-wavelength variations in p2 are 100% correlated to the matter density 5 on large scales, and 
the non-Gaussian halo bias is non-stochastic. If r NL > (|/ N l) 2 > then p2 and 5 are not 100% 
correlated, leading to stochastic bias. 

3.3.2 #nl Cosmology 

The g NL model is a non-Gaussian model in which the initial potential $ is given in terms of a 
single Gaussian field <j) by: 

*(as) = <f)(x) + g NL (<j ) 3 (x)-3(<j ) 2 )<j ) (x)) . (3.36) 
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+ ( (M) + #j (M) ) p 3 • [narrow sample] 



We keep the first three terms in the series expansion for <5 n , obtaining: 

bf w 5 M + CK2P2 + «3P3 j [mass-weighted sample] 

6-5 M + (&(M) + /9 2 (M)^) p 2 (3-37) 

^ 9 In o-jv/ 

To compute power spectra we will need the following cumulants in the <7 NL model: 

d 4)[trC ° ] = 5nl [P1P2P3 + 23 perms.] + 0(g 2 NL ) , (3.38) 
e (4)[loo P ] = 95 2 l [ Pl p 2 p^ (A . 13 ) + n perms .] ( (3.39) 

4 6) = 36 5nl [P1P2P3P4P125 + 89 perms.] . (3.40) 
Here, we have defined Pijk = P<f>{\ki + kj + fe&|) and 

i>(*0 ee 2 / P4q)P4\k - q\) ~ 4A* hi{kL)P^{k) , (3.41) 

where A 2 , = (A; 3 /2n 2 )P 4> (k) and we have regulated the infrared divergence by putting the field 
in a finite box of size L. Note that the power spectra Ps P2 and P P2P3 are zero (since there is a 
$ —> — symmetry). The remaining power spectra can be calculated by substituting eqs. (3.38), 
(3.39) and (3.40) into eqs. (3.22) and (3.23). In the limit k -> 0, this gives 

Pmm(k) J/ NL =i) 
a(/c) 



iWfc) = 3 9 n,^F4 ;N1= J , (3-42) 



P P 2pM = ^ ( / al(q)P$(q)) + 36(&iV(*) , (3.43) 



2 -Pmm(fc) / (/ NI =1)\ 2 

' a 2 (fc) V 3 ) 



W*) = ^L 1 ^ (4 /NL=iJ J • (3-44) 



Here, Kg ~~ denotes the dimensionless skewness parameter K3 = (5f I (x)} c /af I in the local 
model with / NL = 1. Note that we use the tree- level cumulant tree ^ when computing Ps P3 , 
but use both the tree-level cumulant and the one- loop cumulant 4 4)[1 °° p] when computing P p2P2 . 
Although the 0(g 2 SL ) one- loop cumulant is generally smaller than the 0(g NL ) tree-level cumulant, 
the one- loop cumulant dominates in the \ki + k%\ — > limit which is relevant for P P2P2 - 
Putting the above calculations together, we find: 4 



PnM - {■• « ( ^) P ^Jk) ■ (3. Joa) 



Phh(k) = (b g + <? NL ^§y) Pmm(k) + \fj^P^{k) , (3.45b) 

where fit was defined in eq. (3.35) and we have defined 

[ 3a3(M)K^/ Nh ~^ , [mass-weighted sample] 
'•/ <!../,.,,. ; „^ a V.(/nl=i) r , (3-46) 



3 MM) + p2{M) m £--) k^-'> . [narrow sample] 



4 We have neglected contributions to P^h(k) which approach a constant as k — > 0; such contributions are 
unobservable in practice since they are degenerate with other contributions such as second-order halo bias. 
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Note that in the narrow mass bin case, there are derivative terms in 5^ (eq. (3.37)), and their 
contributions to P m h and Phh are non-zero (unlike the previously considered r NL model) , because 
the power spectra P PmP „ in eqs. (3.42)-(3.44) depend on halo mass via the mass-dependent 
quantity 4 /nl=1) . 

These expressions for P m h and Phh agree with previous calculations in the literature based 
on the Edgeworth expansion [4, 5, 8]. Our series expansion gives some physical intuition as 
follows. The non-Gaussian contribution to P m h comes from the power spectrum P$ pa , and can 
therefore be interpreted as arising from long- wavelength variations in the locally measured small- 
scale skewness P3. On large scales, the non-Gaussian fluctuations in p^ are 100% correlated to the 
density field, and therefore the associated halo bias is non-stochastic. The leading contribution to 
stochastic bias comes from the power spectrum P P2P2 and can be interpreted as long-wavelength 
variations in small-scale power which are uncorrelated to the density field. 

4 Proof of the Equivalence 

In the previous section, we showed that the barrier crossing model can be formulated as a series 
representation: 



In this section, we will use this result to prove that barrier crossing is mathematically equivalent 
to local biasing (§4.1) and peak-background split (§4.2). 

4.1 Local Biasing 

"Local biasing" refers to any model of halo clustering in which the halo field is represented as a 
local function of the dark matter density, e.g. a power series 



Several versions of local biasing exist in the literature (e.g. [11-14]). We notice that the series 
on the right-hand side of (4.1) is a type of local biasing expansion, since the p n fields are local 
functions of the smoothed density field 5 M . Therefore, our series representation proves that 
the barrier crossing model is mathematically equivalent to a specific version of the local biasing 
formalism. In this section, we would like to elaborate on the connection between our series 
representation and the usual way of thinking about local biasing, and comment on the differences 
with other versions of the formalism. 

First, the density field 5 M which appears in the series representation is the non-Gaussian and 
linearly evolved density field, smoothed on the mass scale M. In particular, there is no need to 
introduce a new smoothing scale which is distinct from the halo scale, as done in some versions 
of local biasing. We do not include non-linear evolution in 5 M since the standard barrier crossing 
model is based on thresholding the linear density field. 

Second, we do not need to introduce explicit dependence of the halo over-density 6^ on the 
long- wavelength potential ^ in a non-Gaussian cosmology. In some versions of local biasing, 6^ is 




[mass- weighted sample] 
[narrow sample] 
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expanded in both 8% and in order to keep the relation local. In our version, the <3?^ dependence 
happens automatically, since 5^ depends on higher cumulants p2, ps, ■ ■ ■ , and these cumulants can 
be correlated with in a non-Gaussian model. To see how this happens in detail, consider the 
/ NL model. Inspection of the power spectra in eqs. (3.32) and (3.33) shows (taking r NL = (§/nl) 2 ) 
that p2 is 100% correlated with the field <3?£ = (k)S M as k — > 0. More precisely, p2 — > 4/ NL <J>^ 
on large scales. Making this substitution in eq. (4.1), we get 5h = b g 5i + f NL /3f^i + - ■ ■ and recover 
the usual result. 

This example shows that including explicit &e dependence in the local expansion of 5^ is not 
necessary (in fact, including it our model would "double-count" the non-Gaussian clustering), if 
higher powers of the density field are included in the expansion. In the / NL model, the modulation 
to the locally measured power p2 is directly proportional to <I>^. More generally, the expansion 
should be in all of the non-negligible cumulants p2, P3, • • • ■ 

It is also interesting to consider the r NL model in the case r NL > (|/nl) 2 - Here, the locally 
measured small-scale power p2 has excess power on large scales which is not 100% correlated with 
leading to stochastic bias [4]. This qualitative behavior is correctly captured by a local biasing 
model of the form 5h = b g 5e + 02^2, but not by a local biasing model of the form 5^ = b g 5i + 62^- 

In the narrow mass bin case, our series expansion includes derivative terms of the form 
dp n /dlna M . To our knowledge, derivative terms have not been been proposed in any version 
of local biasing which has appeared in the literature. In the barrier crossing model, derivative 
terms appear naturally for a narrowly selected halo sample, since this case is obtained from the 
mass-weighted case (which does not contain derivative terms) by differentiating with respect to 
halo mass. 

Finally, even in the mass-weighted case, there is a difference between the Hermite polynomial 
expansion 

5 h (x) = b™ w 5 M (x) + V a n {M)H n ( 5 J^1\ (4.3) 

n>2 V °M J 

and a power series expansion of the form 

S h (x) = hS M (x) + b 2 5 2 M (x) + b 3 S 3 M (x) + ■■■ . (4.4) 

At first sight, the two may appear equivalent: if both series are truncated at the same order 
N, then we can rearrange coefficients to transform either series into the other (since both just 
parametrize an arbitrary degree- N polynomial). However, when we write the power series expan- 
sion (4.4), we are assuming that the values of the low-order coefficients &i, i>2> ■ ■ ■ are independent 
of the order iV at which the series is truncated. This means for example that in a Gaussian cos- 
mology, the matter-halo power spectrum P m h(k) = (bi + 30^63 + 150^65 + • • • )P mm (k) depends 
on where the series is truncated. In contrast, the Hermite expansion (4.3) is more stable: -P m h(&) 
is always equal to bg IW P mm (k), regardless of how many terms are retained in the series. Note that 
the barrier crossing model has a convergent Hermite polynomial expansion (3.15), but cannot be 
sensibly expanded as a power series in 5 M , since the Heaviside step function Q(5 m /^m — S c ) is 
not an analytic function of 5 M . 

In summary, the barrier crossing model is mathematically equivalent to a specific version of 
the local biasing formalism in which the following choices have been made: we linearly evolve 
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the density field and smooth it at mass scale M; we include higher cumulants P2,P3,'" in the 
density field, but not additional fields such as the potential <&i; derivative terms appear in the 
narrow mass bin case; and we use a Hermite polynomial expansion in 8 M /a M rather than the 
power series expansion. Other variants of the local biasing formalism exist in the literature, and 
we are not claiming that our choices are optimal (in the sense of producing best agreement with 
simulations) ; the purpose of this section was simply to point out which set of choices is equivalent 
to the barrier crossing model. 

4.2 Peak-Background Split 

The "peak-background split" is a formalism for modeling halo clustering on large scales, in which 
one relates large-scale modes of the halo density field #h to large-scale modes of fields whose 
power spectra can be calculated directly. For example, the PBS formalism was applied to an / NL 
cosmology in [24]. On large scales, k <C i?^ 1 ) one can argue that the halo density is related to 
the linear density field 5 and the Newtonian potential $ by 



where b g is the usual Gaussian bias, and f3f = 231nnh/<91n<Ts. Using this expression, it is easy 
to show that the large-scale bias is given by b(k) = b g + f NL [3f /a(k), and is non-stochastic. For 
additional examples of the PBS formalism applied to non-Gaussian models, see [3, 4, 8]. In this 
section, we will show how the PBS formalism generalizes to an arbitrary non-Gaussian model, 
and give a simple proof that this generalization is equivalent to the barrier crossing model. We 
will work out in detail the case of a mass- weighted halo sample; the narrow mass bin case follows 
by differentiating with respect to M. 

There is one technical point that we would like to make explicit. We want to generalize 
the peak-background split formalism so that it applies to an arbitrary non-Gaussian model, 
parametrized by the iV-point correlation functions of the initial Newtonian potential <3?. As an 
example, consider the t nl model from §3.3.1, with constituent fields eft, ip. The PBS analysis of 
this model has been worked out in [3, 4] and requires keeping track of the long- wavelength parts 
(f>l,ipt of both fields, in order to correctly predict non-Gaussian stochastic bias on large scales. 
(Intuitively, multiple fields are needed because we need to keep track of long-wavelength density 
fluctuations and long-wavelength variations in the locally measured small-scale power, and the 
two are not 100% correlated in the r NL model.) This raises a conceptual puzzle: how would we 
get stochastic bias if we were just given correlation functions of the single field <£, rather than 
a description of the r NL model involving multiple constituent fields? As we will now see, we 
must extend the PBS formalism by introducing additional fields which correspond to the locally 
measured small-scale power, small-scale skewness, kurtosis, etc. These fields are precisely the 
quantities p2,P3,- •■ which appeared earlier in our series expansion in §3. This will allow us to 
connect the PBS formalism with the barrier crossing model (and in fact prove that the two are 
mathematically equivalent). 

Consider a large subvolume of the universe containing many halos, but over which the long 
mode is reasonably constant, and let denote a spatial average over the subvolume. Let us 
assume that the halo number density (n^je in the subvolume is a function of the one-point PDF 




(4.5) 
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of the underlying dark matter field 5 M (when linearly evolved and smoothed on the halo scale). 
For weakly non- Gaussian fields, the one-point PDF in each subvolume can be characterized 
completely by its mean (5 M )e, variance {<J 2 M )i, and higher cumulants (K n )e = ((C)c/ ff S{)< f° r 
n > 3. Therefore we can write (nh)^ = nh((8 M )z, (cr^)^, {(k„)4). Taylor expanding to first order 
in these parameters, we get 

/ \ - (, , <91nn h <91nn h . 2 2 \ , \^ <91nra h \ 

(n h )i = n h 1 + , , (<W)l + , 2 , (K, )<? - <7 M j + > — r • (4.6) 

Here, we have used the notation (cr^)^ to denote the variance of 5 M restricted to the subvolume, 
and o~ 2 M to denote the global variance. To make contact with our previous notation, note that 
((°m)^ ~~ °Ar) = °~M{P2)e and (« n )^ = (Pn)i- 5 Making these substitutions in eq. (4.6), we get 

<91nn h 2 <91nn h ^<91nn h 

Since this equation applies when taking the subvolume average (-)^ over any large subvolume, it 
also applies to any large-scale Fourier mode: 

c ,,s fe^o <91nn h 2 <91nn h ^<9hm h 

(5 h (fe) ► 5 M (fc) + <__ /02 (fe) + 2^^— Pn (fc) . (4.8) 

M m n=3 n 

Let us compare this expression with our series representation of 5^ in the barrier crossing model: 

5 h (k) = b™ w 8 M {k) + Y, otn{M) Pn {k) . (4.9) 

n>2 

The form of the two series representations is the same, but the coefficients appear to be different. 
In the barrier crossing model, we have the following explicit formula for the coefficient a n (M) of 
the n-th term in the series: 

a n {M) = W — - , (4.10) 

V vr er f c (^=^ c ) nl 

whereas in the PBS derivation, a n is given by a suitable derivative of the halo mass function: 

2 dmra h <91nn h , . 

ct2 = cr M and a n = — for n > 3 . (4.11) 

c/<7 M a«; n 

If we assume a Press-Schechter mass function, then one can evaluate the mass function derivatives 
in the above equation using the machinery from [7]. The result agrees precisely with the explicit 
formula (4.10). Therefore, the barrier crossing model and the generalized PBS formalism with 
fields p2, P3, • • • are formally equivalent, but only under the assumption of a Press-Schechter mass 
function (note that this assumption is "built in" to the barrier crossing model). 

If we relax the assumption of a Press-Schechter mass function, then the barrier crossing model 
and the generalized PBS formalism can both be written as series expansions with the same general 



5 The identity (p n )e = {Kn)e holds for n < 5, but has non-linear corrections for n > 6. For example, (pe)e = 
{K(i)e + 10(ft3)f . We have neglected these non-linear corrections since eq. (4.6) is an expansion to first order anyway. 
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form, but make different predictions for the coefficients a n (M). One can ask which prediction 
agrees better with iV-body simulations. In [8], the two predictions for a 3 were compared with 
simulations in the context of the g NL model. It was found that the PBS prediction (4.11) is exact 
(within the ~ 1% statistical error of the simulations) if both the bias and the mass function 
derivative (dlnn^/dK^) are evaluated numerically from the simulations. The barrier crossing 
prediction (4.10) is an approximation: although it is based on an exact calculation within the 
barrier crossing model, this model is an approximation to the true dynamics of an iV-body 
simulation. The approximation works reasonably well for large halo mass but breaks down for 
low masses, motivating the use of fitting functions for practical data analysis. It is natural to 
conjecture that the same qualitative statements will be true for the a n coefficients with n > 3, 
but we have not attempted to verify this with simulations. (Note that no fitting function is 
necessary for 02, since the relation (3f « 25 c b g holds to « 10% accuracy in iV-body simulations.) 

In summary, the barrier crossing model is mathematically equivalent to the PBS formalism, 
appropriately generalized to an arbitrary non-Gaussian cosmology by introducing additional fields 
P2, P3i • • • , plus the additional assumption of a Press-Schechter mass function. The barrier crossing 
model is analytically tractable (e.g. one can derive closed-form expressions for the coefficients 
a n {M) and f3 n (M)), and usually a reasonable approximation, making it very useful for analytic 
studies or forecasts. However, for data analysis, it may be necessary to go beyond the Press- 
Schechter approximation by replacing the closed- form expressions for coefficients such as a n (M) 
with their PBS counterparts measured from simulations. 

5 Conclusions 

In this paper, we have proven the mathematical equivalence of barrier crossing, peak-background 
split and local biasing. We first introduced a Hermite polynomial expansion of the halo density 
contrast <5h in the barrier crossing model: eqs. (3.17) and (3.21). We showed that this allows a 
computationally efficient way to calculate the clustering power spectra P m h and Phh- Moreover, 
the series expansion makes the formal equivalence of the various halo modeling formalisms very 
transparent. First, it automatically takes the form of a local biasing model, in which the non- 
Gaussian and linearly evolved density contrast is expanded in Hermite polynomials. Second, it 
provides a very natural connection between barrier crossing and peak-background split. To make 
this relationship manifest, we generalized the PBS formalism so that it can be applied to the 
most general set of non-Gaussian initial conditions, parametrized by the iV-point functions of the 
primordial potential. This extension of PBS involves additional fields which correspond to the 
locally measured small-scale power, small-scale skewness, kurtosis, etc. Mapping those fields to 
fields in the Hermite polynomial expansion of the barrier crossing model, we proved the mathe- 
matical equivalence between PBS and BC. Finally, although, in this paper, we have concentrated 
on computing power spectra, our series expansion should also be useful for analyzing the effects 
of primordial non-Gaussianity on other clustering statistics, such as the halo bispectrum [14]. 
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A Hermite Polynomials 

In this paper, we have used the probabilists ' definition of Hermite Polynomials 



H n {y) = (-l)V 2 / 2 



u 2 /2 



(A.l) 



satisfying the recursion relation 



H n+l (v) = uH n {v) - H' n {u) 



(A.2) 



and the orthogonality condition 




(A.3) 



For reference, we list some of the low-order Hermite polynomials 



H (u 
H 2 {v 



= v 3 - 3i> , 

= u A - 6u 2 + 3 . 



= 1 , 



= v 



(A.4) 
(A.5) 
(A.6) 
(A.7) 
(A.8) 



We have made use of the following integral 



1 




1 



e-*l 2 H n {y) 




H n ^{u c ) n>l 



(A.9) 
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